An XML Schema integration and query mechanism system

نویسندگان

  • Sanjay Kumar Madria
  • Kalpdrum Passi
  • Sourav S. Bhowmick
چکیده

The availability of large amounts of heterogeneous distributed web data necessitates the integration of XML data from multiple XML sources for many reasons. For example, currently, there are many e-commerce companies, which offer similar products but use different XML schemas with possibly different ontologies. When any two such companies merge, or make an effort to service customers in cooperation, there is a need for an integrated schema and query mechanism for the interoperability of applications. In applications like comparison-shopping, there is a need for an illusionary centralized homogeneous information system. In this paper, we propose XML Schema integration and querying methodology. We define an object-oriented data model called XSDM (XML Schema Data Model) and present a graphical representation of XML Schema for the purpose of schema integration. We use a three-layered architecture for XML Schema integration. The three layers included are namely pre-integration, comparison and integration. The three layers can conceptually be regarded as three phases of the integration process. During pre-integration, the schemas present in XML Schema notation are read and converted into the XSDM notation. During the comparison phase of integration, correspondences as well as conflicts between elements are identified. During the integration phase, conflict resolution, restructuring and merging of the initial schemas takes place to obtain the global schema. We define integration policies for integrating element definitions as well as their datatypes and attributes. An integrated global schema forms the basis for querying a set of local XML documents. We discuss various strategies for rewriting the global query over the global schema into the sub-queries over local schemas. Their respective local schemas validate the subqueries over the local XML documents. This requires the identification and use of mapping rules and relationships between the local schemas.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Querying Heterogeneous XML Sources through a Conceptual Schema

XML is a widespread W3C standard used by several kinds of applications for data representation and exchange over the web. In the context of a system that provides semantic integration of heterogeneous XML sources, the same information at a semantic level may have different representations in XML. However, the syntax of an XML query depends on the structure of the specific XML source. Therefore,...

متن کامل

Une approche matérialisée basée sur les vues pour l'intégration de documents XML. (A view-based approach to the integration of structured and semi-structured data)

Semi-structured data play an increasing role in the development of the Web through the useof XML. However, the management of semi-structured data poses speci c problems because semi-structured data, contrary to classical databases, do not rely on a prede ned schema. The schemaof a document is contained in the document itself and similar documents may be represented bydi erent sc...

متن کامل

Query rewriting for open XML data integration systems

This paper presents OpenXView, a model for open XML data integration systems, characterized by the autonomy of users that publish XML data on a common topic. Autonomy implies frequent and unpredictable changes to data and a high degree of structure heterogeneity. The OpenXView model provides an original integration schema, based on a hybrid ontology XML schema structure. We propose solutions fo...

متن کامل

Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica

Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...

متن کامل

Source Identification and Query Rewriting in Open Xml Data Integration Systems

This paper presents OpenXView, a model for open, large scale XML data integration systems, characterized by the autonomy of users that publish XML data on a common topic. Autonomy implies frequent and unpredictable changes to data and a high degree of structure heterogeneity. OpenXView provides an original integration schema, based on an hybrid ontology XML schema structure model. We propose so...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 65  شماره 

صفحات  -

تاریخ انتشار 2008